(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
3 July 2003 (03.07.2003) 




PCT 



(10) International Publication Number 

wo 03/054475 A2 



(51) International Patent ClassificatioD^: GOIB 1 1/06, 
11/24, 1 1/30. COIN 21/21. IIOIL 21/66. G03F 7/20 

(21) International Application Number: PCT/US02/4 1151 

(22) International Filing Date: 

19 December 2002 (19.12.2002) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/343,077 
Not furnished 



19 December 2001 (19.12.2001) US 
19 December 2002 (19.12.2002) US 



(71) Applicant: KLA-TENCOR CORPORATION [USAJS], 
160 Rio Robles. San Jose, CA 95134-1809 (US). 

(72) Inventors: SHCHEGROV» Andrei, V.; 325 Union 
Avenue. Apt. 359, Campbell, CA 95008 (US). FAB- 
RIKANT, Anatoly; 2245 Olive Avenue. Fremont, CA 
94539 (US). NIKOONAHAD, Mehrdad; 271 Oakhurst 
Place, Menlo Park, CA 94025 (US). LEVY. Ady; 1621 
Samedra Street, Sunnyvale. CA 94087 (US). WACK, 
Daniel, C; 930 Lundy Unc, Los Altos, CA 92024 (US). 



BAREKET, Noah; 12344 Saraglen Drive, Saratoga. CA 
95070 (US). IVQEHER, Walter; 1750 Nantucket Circle. 
Apt. 332. Santa Clara, CA 95054 (US). DZIURA, Ted; 
3664 Tunis Avenue, San Jose, CA 95132 (US). 

(74) Agents: HSUE, James, S. et al.; Parsons Hsue & de Runtz 
LLP, 655 Montgomery Street, Suite 1800, San Francisco, 
CA 94111 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT. AU, 
AZ, BA. BB. BG. BR, BY, BZ. CA, CII, CN, CO, CR. CU. 
CZ, DE, DK, DM. DZ, EC, EE. ES. H, GB. GD, GE, GH, 
GM, HR, HU, ID, tL, IN. IS, JP. KE. KG, KP, KR, KZ, LC, 
LK, LR, I^, LT, LU, LV. MA, MD, MG. MK, MN. MW. 
MX, MZ, NO, NZ, CM, PH, PL, PT, RO. RU, SC, SD, SE, 
SG. SK. SL, TJ, TM, TN, TR, TT, TZ, UA, UG. UZ, VC, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE. I^, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY. KG, KZ. MD, RU, TJ, TM), 
European patent (AT, BE, BG. CH, CY, CZ, DE, DK, EE, 
ES, FI, FR. GB. GR, IE, FT. LU, MC, NL, PT, SE. SI, SK, 
TR), OAPl patent (BF. BJ, CF, CG. CI, CM, GA, GN, GQ. 
GW. Ml., MR, NE, SN, TD. TG). 

[Continued on next page] 



= (54) Title: PARAMETRIC PROFILING USING OPTICAL SPECTROSCOPIC SYSTEMS 



< 

in 

IT) 
O 

o 

o 




(57) Abstract: A gallery of seed profiles is constructed and the initial parameter values associated with the profiles arc selected using 
manufacturing process knowledge of semiconductor devices. Manufacturing process knowledge may also be used to select the best 
seed profile and the best set of initial parameter values as the starting point of an optimization process whereby data associated with 
parameter values of the profile predicted by a model is compared to measured data in order to arrive at values of the parameters. Film 
layers over or under the periodic structure may also be taken into account. Different radiation parameters such as the reflectivities Rs, 
Rp and ellipsomctric parameters may be used in measuring the diffracting structures and the associated films. Some of the radiation 
pararaelens may be more sensitive to a change in the parameter value of the profile or of the films then other radiation parameters. One 
or more radiation parameters that are more sensitive to such changes may be selected in the above-described optimization process 
to arrive at a more accurate measurement. The above -described techniques may be supplied to a track/stepper and etcher to control 
the lithographic and etching processes in order to compensate for any errors in the profile parameters. 
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PARAMETRIC PROFILING USING OPTICAL SPECTROSCOPIC SYSTEMS 

Inventors: Andrei V. Shchegrov, Anatoly Fabrikant, Mehrdad Nikoonahad 
Ady Levy, Daniel C. Wack, Noah Bareket, Walter Mieher, and Ted Dziura 



CROSS REFERENCE TO RELATED APPLICATION 

[OOOIJ This application is a continuation of U.S. Provisional Application No. 
60/343,077, jSled December 19, 2001, which is incorporated herein by reference in its 
entirety. 

BACKGROUND OF THE INVENTION 

[0002] This invention relates in general to systems for finding profiles of 
topographical features of small dimensions, such as those of a diffracting grating, and in 
particular to such systems using optical spectroscopic techniques. 

[0003] As the integration density and speed of microelectronic devices increase, 
circuit structures continue to shrink in dimension size and to improve rn terms of profile 
edge sharpness. The fabrication of state-of-the-art devices requires a considerable 
number of process stq)s. It is becoming increasingly important to have an accurate 
measurement of submicron linev/idth and quantitative description of the profile of the 
etched structiires on a pattern wafer at each process step. Furthermore, there is a growing 
need for wafer process monitoring and close-loop control such as focus-exposure control 
in photolithography. 

[0004] Spectroscopic diffraction-based techniques are especially well suited for 
microelectronics metrology applications because they are nondestructive, sufficiently 
accurate, repeatable, rapid, simple and inexpensive relative to critical dimension-scaiming 
electron microscopy. In such diffraction-based analysis techniques, typically a model of 
the profile is first constructed, where the model includes a number of parameters that can 
be varied. One or more diffraction intensity versus \vavelength curves are calculated 
based on the model constructed and the curve(s) are compared with measured diffraction 
data from the sample. The parameters are then adjusted until a match is found between 
the curve(s) and the measured data. 
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[0005J The current methods being used include multi-slab models where a number of 
rectangular or trapezoidal slabs are put on top of one another to form a seed profile that is 
an approximation of the profile being measured. The parameters that can be adjusted 
include width and height of the rectangles or width, height and sidewall angle of the 
trapezoids. It is found that in the wafer processing processes, a number of very different 
profiles of structures may be encountered. The ciurent mefliods are inadequate for 
measuring a wide variety of very different profiles in the manufacturing process. A 
simple increase of the number of slabs to model such variety of profiles requires the 
generation of huge libraries whose size grows exponentially with the number of slabs and 
the associated parameters. Furthermore, different sets of parameters, corresponding to 
different profiles, can produce indistinguishable spectroscopic data, resulting in a 
problem known as cross-correlation. 

[0006] In U.S. Patent No. 5,963,329, Conrad et al. proposed an improved method to 
measure actual profiles. In this model, the number of independent parameters or 
variables is reduced by adopting particular profile shapes such as a "S" line profile, by 
dividing the model line profile into two or more sub-profilies and providing a numerical 
model of each sub-profile so that fewer scaling factors may be used to adjust all slab 
widths and heights within the single sub-profile. 

[0007] While the above-described method of Conrad et al. reduces the number of 
parameters that one needs to contend with, this method still has some drawbacks. Thus, it 
caimot be used for measuring line profiles made of more than material, and for measuring 
optical parameters as well as geometric parameters. It is therefore desirable to provide an 
improved model that can be used for determining the above mentioned samples in a 
maimer so that the solution converges to a single solution without a high risk of cross- 
correlation. 

[0008] As noted above, the shapes of line profiles encountered on semiconductor 
wafers during fabrication can take on a wide variety of shapes. Such line profiles are 
typically situated on and/or below layers of materials Vhich may be the same as or 
different from the material of the profiles. When diffraction-based spectroscopic 
techniques are used to measure such profiles, the radiation used in the technique would 
interact with the one or more layers and transmitted or reflected radiation from the layers 
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is detected by the detectors that are used for detecting radiation from the line profile. 
Where it is not possible or very difficult to separate the contribution of the signal due to 
the layers from the contribution of the signal due to the line profile, it is desirable for any 
technique used to measure the parameters of such layers simultaneously with 
measurement of the Une profile. None of the existing techniques has such capability. It 
is therefore desirable to provide an improved system where the contribution of such 
layers to the detector signal can be taken into account. 

[0009] Currently in the market the common methods of determining a profile (cross 
section) of a structure are: scanning electron microscopy or SEM (cross section and top 
down), atomic force microscopy or AFM, and scatterometry. For production monitoring 
scatterometry is being established as the leading method for lot by lot monitoring using 
periodic test targets. 

[0010] The basic methodology in scatterometry is the comparison of the measured 
(typically spectral) data to a library that has been prepared in advance and contains the 
possible variations of the target profile AND underlying layers. However, in many 
situations the mmiber of variables (e.g. underlying layer thickness in a damascene layer) 
is prohibitively large and therefore prevent the user fi-om creating a library. 

[0011] U.S. Patent 5,963,329 describes the use of a real time regression algorithm for 
the determination of the grating profile using its measured spectral reflected intensity. A 
major difficulty with this algorithm is that the regression time becomes prohibitive for 
more than 4 degress of freedom (floating parameters such as the CD, side wall angle and 
underlying film thickness). This prevents the user from using this methodology for the 
measurements of damascene structures or even photo-resist on complex/variable films. In 
addition, the added number of degrees of freedom results in a non-robust root 
convergence that will tend to lock onto local correlated minima. 

[0012] The major disadvantages of the above methods are as follows. It is difficult to 
create a Hbrary for gratings on multi variable films (e.^. photo resist on damascene layer 
or etched trenches or vias in inter-metal dielectrics). It is also difficult to regress in real 
time on more than 4 floating profile and film variables. It is therefore desirable to 
provide an improved system to alleviate such problems. 
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SUMMARY OF THE INVENTION 

[0013] Semiconductor devices are fabricated by processing equipment with certain 
set parameters of the manufacturing process, such as the time, temperature, focus and 
exposure dose in the Uthography and other parameters, such as the time and temperature 
for the deposition of certain layers, or the time, and nature of etching processes. Once 
these parameters are known, it is possible to simulate the profile of the structures that will 
result from such manufacturing process. A gallery of seed profiles or profile types may 
be used as possible starting points for finding the actual shapes of line profiles. 
Preferably, knowledge of manufacturing process parameters may be utilized in the 
construction of a gallery of profile types from which a particular profile type can be 
chosen for matching with the measured data. Also preferably, knowledge of 
manufacturing process parameters is utilized to select from the gallery a particular profile 
type that would serve as the best seed profile for the purpose of finding the actual profile 
ofstmctures. 

[0014] As noted above, the diffracting structure to be measured is frequently located 
on and/or below one or more layers of the same or different material, so that the detector 
employed would detect radiation influenced by such layers as well as diffraction fix)m the 
diffracting structure. These layers would have to be taken into account in the model. 
Parameters such as thickness and index of refraction (n and k) of these layers would be 
more sensitive to certain measurement parameters than others. This is also true of the 
parameters characterizing the diffracting structure. Therefore, in another embodiment of 
the invention, more than one set of radiation data may be generated from each profile 
type, where the sets of radiation data generated are of different radiation parameters, such 
as reflectance or transmittance parameters and ellipsometric parameters. For a given 
change in the parameter of the profile type (e.g., width, height, sidewall angle, index of 
refraction of the diffracting structure and thickness and index of refiraction of the one or 
more layers) may be more sensitive to the ellipsometric parameters than to the 
transmittance or reflectance parameters, or vice versa. In such event, it may be desirable 
to choose the set of radiation data and the associated radiation parameters that are more 
sensitive to a change in the parameter of the profile or a characteristic of the one or more 
layers to improve the accuracy and precision of the modeling and matching algorithm. 
This feature can also be used where the effects of the layers need not be taken into 
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account, such as where the effects are known, can be ignored or where there is no layer 
associated with the structure. 

[001 5] Independent of the above considerations, reflectance or transmittance 
parameters and ellipsometric parameters of the collected radiation may be used together 
for deriving one or more parameters of a profile with arbitrary shape. 

[0016] The gallery of profile types may be stored in a database made available to 
users and an optional processor may be used to select the profile type firom the gallery 
and compare the detected measured data to that associated with the selected profile type 
to arrive at a set of values of the one or more parameters of the profile type. 

[0017] Where the profiles measured are useful for controlling a wafer manufacturing 
process, the measured information may be used to control the processing system for 
adjusting one or more processing parameters. Thus, if the profile of the structure 
measured indicates a problem in the processing system, the processmg system may be 
adjusted to reduce or eliminate the effects of the problem. Any one of the above- 
described techniques may be used to find a profile of a structure and/or characteristics of 
one or more layers in the vicinity of the structure, and these values may then be supplied 
to a semiconductor wafer processing machine, such as a track, stepper and/or etcher, to 
control the lithographic and/or etching process in order to compensate for any errors in 
one or more parameters of the profile that has been discovered. The track, stepper and/or 
etcher may form a single tool with a system for finding the one or more parameters of a 
profile, or may be instruments separate firom it. 

[0018] To reduce the complexity in determining parameters such as the critical 
dimension, side wall angle and thickness of scattering and diffracting structures and of 
the properties of film stacks above and/or below the scattering and diffracting structures, 
multiple measurements may be combined. Thus, in order to simplify the method for 
determining one or more parameters of a diffracting structure, a reference structure may 
be measured where the reference stmcture comprises at least one layer that has 
substantially the same thickness as tlie diffracting structure, and/or comprises a material 
having substantially the same optical properties as those of a material in the diffracting 
structure. Information so obtained concerning the reference structure may then be used to 
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simplify the determination of the parameters of the diffracting structure. 

[00191 Where the diffracting structure is located adjacent to one or more films, ttie 
reference structure may also be located adjacent to a film structure than contains one or 
more layers whose properties are similar to those of one or more films adjacent to the 
diffracting structure. Therefore, by measuring the reference structure together with the 
properties of layers adjacent to it, the information so obtained on the properties of the 
layer(s) with similar properties can be used for determining the parameter values of the 
diffracting structure. 

[0020] The reference structure may be a smooth or diffracting structure on the same 
sample as the diffracting structure and the films associated with the reference structure 
may be formed in the same processing steps as the film structure adjacent to the 
diffracting structure, so that both the diffracting and reference structures are situated 
adjacent to two different fikn stacks with the same properties. 

[00211 Where two diffracting structures are present on the same sample, or on 
different samples of the same lots made by the same processing steps, to simplify the 
profile measurement of the structures, the profile obtained from one diffracting structure 
may be used as the seed profile in an optimization process for determining the parameters 
of another diffracting structure. 

[0022] Information obtained by a scatterometric measurement may be fed to a critical 
dimension-scanning electron microscope measurement (CD-SEM) on the same target, or 
a different target on the same wafer or on a different wafer in the same lot made by the 
same processing process. The use of scatterometric data would help eliminate 
uncertainty that exists in scanning electron microscope algorithms with respect to the wall 
angle dependence of critical dimension measurements, and provides an absolute 
calibration of the pitch for the scaiming electron microscope elements. 

[0023] Scatterometric measurement data may also be fed to an overlay measurement 
tool on the same target or a different target on the same wafer, or on a different wafer but 
in the same lot produced by the same process. The use of scatterometric data would 
assist in eliminating the uncertainty that would exist in overlay algorithms caused by the 
profiles of the target. 

-6- 
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[0024] In some measurements, it may not be possible to obtain the properties of film 
stacks over and/or below the diffracting structure. In such event, it may be desirable to 
determine the profile of the structure by using only a portion of the collected data, or the 
data of one of the parameters associated with the diffracting structure, where the 
influence of the film stack on measurements of the diffiacting structure is minimized. In 
other words, the subset of data of the parameters and/or the parameter(s) used in the 
optunization process would be selected to minimize the effect of the films stack on the 
optimization process. 

[0025] While the above-described features may be implemented as a stand-alone 
system and integrated with optical equipment for carrying out the measurements, it is 
possible for existing optical measurement equipment to be modified or otherwise enabled 
so that it has the capability described above. Thus, the above-described features may be 
embodied as a program of instructions executable by computer to perform the above- 
described different aspects of the invention. Hence any of the techniques described above 
may be performed by means of software components loaded into a computer or any other 
information appliance or digital device. When so enabled, the computer, appliance or 
device may then perform the above-described techniques to assist the finding of value(s) 
of the one or more parameters usmg measured data from a diffracting stmcture and/or the 
associated one or more layers. The software component may be loaded from a fixed 
media or accessed through a communication medium such as the internet or any other 
type of computer network. 

[0026] Each of the inventive features described above may be used individually or in 
combination in different arrangements. All such combinations and variations are within 
the scope of the invention. 

BRIEF DESCRIPTf ON OF THE DRAWINGS 

[0027] Fig. I A is a schematic view of a spectroscopic measurement device usefiil for 
illustrating the invention. 

[0028] Fig. 1 B is a cross-sectional view of a two-dimensional grating and associated 
layers usefiil for illustrating the invention. 



[0029] Fig. 2 is a schematic view of another spectroscopic measurement device useful 

-7- 
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for illustrating the invention. 

[0030] Figs. 3 A, 3B, 3C are cross-sectional views of two-dimensional structures 
encountered in semiconductor manufacturing useful for illustrating the invention. 

[0031] Fig. 3D is a perspective view of a three dimensional periodic structure with 
via holes useful for illustrating the invention. 

[0032] Figs. 4A-4F are sample profiles to illustrate a gallery of profile types or 
models to illustrate an embodiment of the invention. 

[0033] Fig. 5A is a flow chart of profile and film measurement to illustrate an 
embodiment of the invention. 

[0034] Fig. 5B is a flow chart illustrating in more detail the diffraction solver in the 
flow chart of Fig. 5 A. 

[0035] Fig. 6 A is a flow chart illustrating the selection of the optimum profile type or 
model and the value of parameters for the initial seat values. 

[0036] Fig. 6B is a flow chart illustrating the process for selecting the optimal 
radiation parameter and the corresponding set of radiation data for matching with 
measured data to illustrate one aspect of the invention. 

[0037] Fig. 6C is a schematic diagram illustrating the selection of the starting point 
for nonlinear optimization firom a course library to illustrate an aspect of flie invention. 

[0038] Fig. 7 is a schematic block diagram illustrating a wafer processing apparatus 
including a track/stepper and an etcher and a spectroscopic measurement device where 
information from a diffracting structure and/or associated structures from the device as 
used to control the manufacturing process and the track, stepper and/or etcher to illustrate 
the invention. 

[0039] Fig. 8 is a schematic block diagram illustrating in more detail the track/stepper 
of Fig. 7. 



[0040] Figs. 9A, 93, 9C are schematic views of sample structures useful for 
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illustrating different embodiments of the invention for determining the profile or 
parameters of a diffracting structure. 

[0041] Fig. 10 illustrates an embodiment where data obtained from a scatterometric 
measurement tool is fed forward to a CD-SEM measurement or overlay tool performing a 
measurement on the same target or a target with some common features. 

[0042] Fig. 1 1 is a schematic view of an overlay tool measuring ttie offset between 
two gratings to illustrate a feature of the invention of Fig. 10. 

[0043] Fig. 1 2 is a one-dimensional image of a box-in-box type target to illustrate a 
featureof the invention of Fig, 10. 

[0044] Fig. 13 is a block diagram showing a representative sample logic device in 
which aspects of the present invention may be embodied. 

. [0045] For simplicity and description, identical components are labeled by the same 
numerals in this application. 

DETAILED DESCRIPTION OF THE EMBODIMENTS 

[0046] Even though much of the description below of algorithms and methods are 
described in terms of the reflected or transmitted intensities of the diffraction caused by 
the diffracting structure, it will be understood that the same techniques and algorithms 
may be used for data containing information concerning changes in the polarization state 
over different wavelengths (e.g. ellipsometric parameters A and ^ as functions of 
wavelength). For this reason, it may be advantageous to employ an instmment which is 
capable of measuring both the reflected or transmitted intensities of the diffraction caused 
by the structure as well as changes in polarization state caused by the diffraction of the 
structure. A suitable system is described below in reference to Fig. 1 A. 

[0047] Fig. 1 A is a schematic view of a spectroscopic diffraction-based metrology 
system to illustrate the preferred embodiment of the invention. As shown in Fig. 1 A, 
system 10 may be used to measure reflected or transmitted intensities or changes in 
polarization slates of the diffraction. As shown in Fig. 1 A, a semiconductor wafer 1 1 
may comprise a silicon substrate 12, and a structure 16 thereon that may include a 
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photoresist pattern on and/or over film stack(s), where the film(s) are at least partially 
light-transmissive and has a certain film thickness and refractive index (n and k, the real 
and imaginary components of the index). 

[00481 An XYZ stage 14 is used for moving the wafer in the horizontal XY 
directions. Stage 14 may also be used to adjust the z height of the wafer 1 1. A 
polychromatic or broadband radiation source such as white light source 22 supplies Ught 
through a fiber optic cable 24 which randomizes the polaiization and creates a uniform 
Ught source for illuminating the wafer. Preferably, source 22 supplies electromagnetic 
radiation having wavelengths in the range of at least 180 to 800 nm. Upon emerging 
from fiber 24, the radiation passes through an optical illuminator 26 that may include an 
aperture and a focusing lens or minor (not shown). The aperture causes the emerging 
light beam to illuminate an area of stmcture 16. The light emerging firom iUuminator 26 
is polarized by a polarizer 28 to produce a polarized sampling beam 30 iUuminating the 
structure 16. 

[00491 The radiation originating from sampling beam 30 is reflected by stmcture 16, 
passed through an analyzer 32 and to a spectrometer 34 to detect different spectral 
components of the reflected radiation, such as those in the spectrum of the radiation 
source 22, to obtain a signature of the structure. In one mode (spectrophotometry mode) 
of operation, the reflected intensities are then used in a manner described below to find 
the value(s) of one or more parameters of structure 16. The system 10 can also be 
modified by placing the spectrometer 34 on the side of stmcture 16 opposite to 
illumination beam 30 to measure the intensities of radiation transmitted ferough 
stmcture 16 instead for the same purpose. These reflected or transmitted intoisity 
components are supplied to computer 40. 

[00501 Alternatively, the light reflected by the structure 1 6 is collected by lens 54, 
and passes through the beam splitter 52 to a spectrometer 60. The spectral components at 
different wavelengths measured are detected and signals representing such components 
are supplied to computer 40. The Ught reflected by structure 16 may be supplied by 
source 22 through illuminator 26 as described above or through other optical components 
in another arrangement. Thus, in such anrangement, lens 23 collects and directs radiation 
from source 22 to a beam splitter 52, which reflects part of the incoming beam towards 

-10- 
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the focus lens 54 which focuses the radiation to structure 16. The light reflected by the 
structure 16 is collected by lens 54, passes through the beam splitter 52 to 
spectrometer 60. 

[0051] When the system 1 0 is operated in another mode (spectroscopic ellipsometry 
mode) used to measure the changes in polarization state caused by the dif&action by the 
structure, either the polarizer 28 or the analyzer 32 is rotated (to cause relative rotational 
motion between the polarizer and the analyzer) when spectrometer 34 is detecting the 
diffracted radiation from structure 16 at a plurality of wavelengths, such as tiiose in tiie 
spectrum of the radiation source 22, where the rotation is controlled (not shown) by 
computer 40 in a manner known to those skilled in the art The diffracted intensities at 
different wavelengths detected are suppUed to computer 40, which derives the changes in 
polarization state data at different wavelengths from the intensities in a manner known to 
those in the art See for example U.S. patent No. 5,608,526, which is incorporated herein 
by reference, 

[0052] Fig. IB is a cross-sectional view of the structure 16 on substrate 12, which 
structure comprises a diffracting structure 16b situated between the film stack 16a above 
the structure and the film stack 16c underneath the structure and an mcident 
electromagnetic beam 30 to illustrate the invention. Thus, the incident beam 30 of the 
electromagnetic radiation first encounters the interface between the air and the fihn 
stack 16a and interfaces that may be present within the stack. Next the portion of the 
radiation from beam 30 that penetrates the film stack 16a is diffracted by the grating 
structure 16b. At least some of the radiation from beam 30 will reach the fihn stack 16c 
underneath the grating and be reflected by or transmitted through interfaces associated 
with stack 16c. The total light reflectance is affected botii by tiie grating and by the fihn 
stacks above and/or below the grating. Multi-layer interference, caused by multiple 
reflections between the fihns and the grating, creates a complicated pattern in a 
reflectance spectrum, which can be used for measuring parameters of the shucture. A 
part of radiation from beam 30 tiiat is not reflected or diffiracted as described above will 
be transmitted into the substrate 12. As shown in Fig. IB, the grating 16b has a height of 
H, a critical dimension ("CD") and a sidewall angle (SWA) as indicated. 

[0053] Fig. 2 is a schematic view of an alternative spectroscopic measurement system 

-11- 
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80 to illustrate the invention. The system of Fig. 2 differs from that in Fig. 1 A in that it 
uses the same optical components for both the spectrophotometry mode measurement as 
well as the ellipsometry measurement, and thus has fewer optical components. On the 
other hand, the two modes need to be employed sequentially and not simultaneously as is 
possible with the apparatus of Fig. lA. As before, where there is relative rotational 
motion between the polarizer 28 and analyzer 32 when a measurement is taken, the 
system 80 of Fig. 2 operates as an ellipsometer. This can be achieved by rotating either 
the polarizer 28 or the analyzer 32, or both. Where there is no relative rotation between 
polarizer 28 and analyzer 32 (such as where both did not rotate, or rotate at the same 
speed), instrument 80 operates as a spectrophotometer or reflectometer. 

[0054] As shown in Fig, 2, system 80 further includes a beam divider 82 which 
diverts the portion of the illumination beam from source 22 to a spectrometer 84 which 
measures variations in the intensity of the illumination beam so that the effects of such 
variations may be removed from the measurements. Beam shaping optics 86 is employed 
to shape the illumination beam, such as by collimating or focusing the beam. 

[00551 While some diffracting structures may take on simple geometric shapes such 
as that illustrated in Fig. IB, in some instances, these structures can take on more 
complex shapes. When this is the case, it is desirable to provide a model by which a 
much wider variety of profiles of structures can be predicted than can conventional 
models. Figs. 3 A-3D illustrate the type of structures that may be encountered during the 
wafer manufacturing process. Fig. 3A is a cross-sectional view of a line grating on top of 
a film stack, where the cross-section of each line is in the shape of a trapezoid 92, and the 
fihn stack comprises layers 94a (bottom anti-reflection coating, or BARC), 94b 
(polysilicon), 94c (silicon dioxide) on top of a substrate 12. 

[0056] Alternatively, the structure may comprise periodic lines where each line 
comprises a stack of several different materials, where the cross-sectional shape of the 
lines is curved. As illustrated in Fig. 3B., the diffiacting structure comprises three layers: 
96a, 96b, 96c and the diffracting stmcture is located on top of a fihn 94 which may 
comprise one or more layers. The structure in Fig. 38 typically results from the process 
of shallow trench isolation (^'STl"). Yet another example of a realistic structure 
encountered in wafer manufacturing is illustrated in Fig. 3C which comprises a line 
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grating with sidewall spacers, made of a material different form that of the line grating. 
As shown in Fig. 3C, each line grating comprises a center portion 98a which is 
substantially rectangular in cross-section and two sidewalk 98a, 98b on the two sides of 
the rectangle where the line structures are situated on top of a fihn 94. The sidewall 
spacers of Fig. 3C are typically used to control the desired shape of polysilicon lines 98 in 
the process of reactive ion etching ("RIE"). 

[0057] Fig. 3D is a perspective view of a periodic structure with via holes where the 
holes may penetrate one or more layers. The via holes provide vertical connections from 
one metallization layer to another. Thus, the structure 16 of Figs. 1 A, 2 may include one 
or more of the diffracting structures and layers shown in Fig. IB, 3A-3D. From the 
shapes of the structures illustrated in Figs. 3 A-3D, it will be evident that prior art 
methods, such as the one described in U.S. Patent 5,963,329, may be inadequate for 
measuring the more complex structures illustrated in such figures. 

[0058] Fig. 4A-4F illustrate examples of profile models which may advantageously 
serve as the seed profiles or profile types that may be employed to derive the actual 
profile of a diffraction structure encountered in semiconductor manufacturing. Fig. 4A 
illustrates a cross-sectional view of a profile type comprising a single-material, multi- 
trapezoid profile, characterized by values of CD, height and sidewall angle for each 
trapezoid. When the sidewall angles are fixed at 90 degrees, this profile type becomes a 
multi-slab model. The bottom trapezoid models a footer. 

[0059] Fig. 4B is a cross-sectional view of a single-material, quartic profile which 

may be represented by the polynomial expression y = ax'*, characterized by the height of 

the profile and coefficient value a. Fig. 4C is a cross-sectional view of a single-material, 

quartic profile with a bottom rounding (i.e., rounded footer), characterized by height, 

quartic coefficient a and parameters of the bottom rounding or footer. More than one 

model may be used for the footer, one example being a model using a smooth fimction 

(e.g. straight line as shown in Fig. 4A, or curved line such as that of a quadratic fimction). 

Fig. 4D is a cross-sectional view of a multi-material, etched quartic profile of the form 

y = ax"*, characterized by coefficient a and the thicknesses of each of the three layers. 

Fig. 4E is a cross-sectional view of a two-material profile with sidewall spacers, 

characterized by height, edge and profiling paor^eters for the inner and outer materials of 
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the spacers, such as common height value for the inner and outer materials, different CD 
values for the inner and outer materials, and a sidewall angle for the outer material. 
Fig. 4F is a perspective view of a three-dimensional structure with the via hole profile in 
a uniform layer, characterized by the height and hole parameters (radius for a circular 
hole). 

[0060] While the quartic profile is illustrated in Figs. 4B, 4C and 4D, profiles that can 
be described using other polynomial expressions may also be used and are within the 
scope of the invention, such as quadratic parabolas, or a combination of quartic and 
quadratic parabolas. In the same vein, while the profile type in Fig. 4A includes multiple 
slabs that are trapezoidal, slabs defined by one or more analytical fimctions, such as 
where one side of the trapezoid is curved, may be employed and are within the scope of 
the invention. 

[0061] The profile types in Figs. 4A-4F do not include layers of material which may 
lie above and/or below the actual diffracting structures measured. These layers can also 
be modeled as described below using parameters for such layers, such as thicknesses and 
indices of refraction (referred to herein also as film parameters), so that the models 
constructed using the profile types can take into account the layers above and/or below 
the diffracting structures measured. In addition, the profile types themselves may include 
layers, such as those illustrated in Fig. 4D. The layers of the profile type in Fig. 4A can 
be modeled using not only geometric parameters, such as the coefficient a and height of 
each of the three layers, but also the complex index of refraction of each of the materials 
in the three layers of the profile itself. 

[0062] Before any measurement of structure 16 is made using the apparatuses in 

Figs. 1 A and 2, a gallery of profile types such as those illustrated in Figs. 4A-4F is first 

prepared and stored in the database. Fig. 5 A is a flowchart of profile and film 

measurement to illustrate a process using a model to measure the parameters of the 

diffracting structure. Where the structure is situated on and/or below one or more layers 

of materials, the model may also be used to measure one or more parameters of such 

layers. As shown in Fig. 5A, an off-line pre-processing tool 102 is used to provide the 

gallery described above together with the seed profile and film parameters associated 

with each of the profile types, such as the profile types illustrated in Figs. 4A-4F, together 
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with layers over and below the profiles shown. The profile parameters can include, for 
example, CD, height, sidewall angle, parameters associated with polynomial expressions 
such as the coefficient a and height of quartic profiles, parameters of the bottom rounding 
and of the spacers, and the indices of refiraction (n and k) parameters of materials of the 
line profile. The film parameters may include thicknesses of the layers and the indices of 
refiraction (n and k). 

[0063] Tool 1 02 then computes from the profile types and their associated profile and 
fihn parameters, as well as initial values of such parameters (e.g. based on estimation, or 
the knowledge or simulation of the fabrication process), predicted spectra radiation data 
associated therewith in a diflSraction solver 108. The operation of the diffraction 
solver 108 is illustrated in more detail in Fig. 5B. As shown in Fig. 5B, the profile type 
may be approximated by slabs (block 1 10). Eignvalues and S-matrices for each slab and 
each film underneath and/or over the profile type are computed (block 112). S-matrices 
are then propagated (block 114) to arrive at a spectrum (block 116), which is the 
predicted radiation data when a profile is measured using the instruments of Figs. 1 A, 2. 
For a detailed description of the modeling process applied by solver 108, please see the 
references below: 

M. G. Moharam, E. B. Grann, D. A. Pommet, and T, K. Gaylord, 
"Formulation of stable and efficient implementation of the rigorous 
coupled-wave analysis of binary gratings,'' J. Opt. Soc. Am. A, vol. 12, 
pp. 1068-1076 (1995); 

L. Li, "Formulation and comparison of two recursive matrix 
algorithms for modeling layered diffraction gratings," J. Opt. Soc. Am. A, 
vol. 13, pp. 1024-1035 (1996); and 

M. G. Moharam, "Coupled-wave analysis of Two-Dimensional 
Dielectric Gratings," PROC. SPIE, vol. 883, pp. 8-11 (1988). 
[0064] Returning now to Fig. 5 A, the spectra associated with the actual diffracting 
structure and the film(s) in structure 16 are then measured using the apparatus of either 
Fig. I A or Fig. 2 (block 1 20) or any other suitable apparatus and the measured data is 
then compared with the predicted spectrum from the diffraction solver 108 (block 122). 
If diere is a good match between the two spectra, the initial values of the parameters of 
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the profile type and of the film(s) then correctly predict those of the actual structure and 
film(s) that are measured (block 124). If the match is less than satisfactory (block 126), 
the profile and film parameters (block 106) are then varied or adjusted by means of a 
nonlinear optimization tool (block 126) in a feed back path. The steps of the diffi-action 
solver 108 and the comparison 122 are repeated until there is a satisfactory match 
between the predicted spectrum and the experimental spectrum. Any number of 
nonlinear optimization tools may be employed, such as those described in the following 
articles: 

J. Nocedal and SJ. Wright, "Numerical Optimization," 
Springer-Verlag, New York, NY (1999); and 

D.T. Pham and D. Karaboga, "Intelligent Optimization 
Techniques: Genetic Algorithms, Tabu Search, Simulated 
Annealing and Neural Networks," Springer-Verlag, New York, NY 
(2000). 

[0065] As described above in reference to Figs. 3A-3D, the actual diffracting 
structures encountered in wafer processing include a wide variety of different shapes. 
According to one aspect of the invention, information concerning the manufacturing 
process may be advantageously used in selecting profile types for the gallery which serve 
as the seed profiles for the modeling process. Thus, the gallery of Figs. 4A-4F are 
selected keeping in mind the structures encountered in semiconductor manufacturing, 
such as those in Figs. 3 A-3D. 

[0066] As noted above, semiconductor devices are fabricated by processing 
equipment with certain set parameters of the manufacturing process, such as the time, 
temperature, focus and exposure dose in the lithography and other parameters for 
deposition of certain layers or of etching processes. Once these parameters are known, it 
is possible to derive the profile of the structures that will result fi'om such manufacturing 
process. A software tool that may be used to simulate the profile of the structures 
resulting from the manufacturing process is PROLITH™ simulator software, available 
from KLA-Tencor Corporation, the assignee of the present application, in San Jose, 
California. This software is described in Inside PROLITH, by Chris A. Mack, Finle 
Technologies (Austin, TX: 1 997). Another possible tool that may be used to simulate 
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the profile of the structures resulting from the manufacturing process is Solid_C, from 
Sigma_C, Munich, Germany. Thus, once information concerning the manufacturing 
process, such as the values of the manufacturing process parameters (e.g., time, 
temperature) is available, the profile that is predicted from such parameters may then be 
used to select a profile type from the gallery of profile types to serve as the seed profile 
for the modeling process illustrated in Figs. 5A, 5B. In addition, the predicted profile 
arrived at using manufacturing process information may also be used to select a set of 
initial values of the parameters associated with the profile type, such as initial values of 
CD, sidewall angle, height, coefficient a and height of quadric expressions or coefficients 
of other polynomial-type expressions, and a process window in which these parameters 
are expected to vary. This process is illustrated in Fig. 6A, 

[00671 As illustrated in Fig. 6A, a lithography simulator 240 (e.g. PROLITH 
simulator) simulates, from parameters of manufacturing process 242, a line profile 244. 
From the simulated line profile 244, the profile type of Fig. 4A in the gallery that is the 
closest match to line profile 244 is then selected as the seed profile. The line profile 244 
is also used to select uiitial values of the different parameters of such profile type, so that 
the predicted profile using such profile type is the closest match to simulated profile 244. 
Thus, in the example in Fig. 6A, initial values of the seven parameters CDi, CD2, CD3, 
CD4 and Hi, H2, H3 are selected for the modelmg process of Figs. 5 A, 5B so that the 
predicted profile 246 shown in Fig. 6A is the closest match to simulated profile 244. By 
makuig use of the manufacturing process as described above, a profile type with initial 
parameter values that is close to the actual structure being measured is selected as the 
seed profile for the modeling process, so that ttie non-linear optimization process 
illustrated in Fig. 5 A can converge rapidly. 

[00681 The above-described modeling process starting with tiie seed profile or profile 
type and with the initial parameter values is illustrated in Fig. 6B. As noted above, from 
information available from the manufacturing process, it is possible to ascertain a process 
window in which parameter values may vary. Such window is illustrated in Fig. 6B, 
where the ranges of the parameters 1 and 2 shown are the ones through which these two 
parameters may vary. The window may be defined with respect to a center point 250, 
and an amount that each parameter is allowed to deviate from the value at this center 
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point; the center point and the deviation allowed for each parameter are derived from the 
manufacturing information. The process window may be divided into different sections 
by a set of vertical lines and a set of horizontal lines, where the intersections between the 
two sets of lines form a set of points, each of which correspond to a pair of values for the 
two parameters. Solver 1 08 may be used to derive the radiation spectra corresponding to 
these pairs of values, where the spectra and their corresponding pairs form a coarse 
library. Where the profile type is characterized by more than two parameters, the window 
would be a space with more than two variables, and each intersection point would 
correspond to a set of more than two parameters. 

[0069] After this coarse library has been constructed, the spectra in the library are 
matched with the simulated data to find the closest match. The intersection point 252 
corresponding to the closest matching spectrum indicates the set of initial parameter 
values that is a good starting point to perform the optimization process of Fig. 5 A. The 
Une 254 in Fig. 6B illustrates schematically the path taken by the optimization process, 
arriving at the final result at point 260 in Fig. 6B. 

[0070] In the profile type of Fig. 4A, for example, there will be at least three 
parameter values: CD, height ("H") and sidewall angle ("SWA"), For some profile 
types, two parameters may be adequate, such as the quartic profile type of Fig. 4B, which 
may be characterized by the coefficient a and height of the profile. As noted above, the 
set of initial parameter values of the profile type selected is such that the predicted profile 
using the profile type from the gallery is the closest match to the simulated profile. Thxis, 
as noted above in reference to Fig. 5B, a spectrum or spectra of a radiation parameter 
over a range of wavelengths 1 16 is arrived at using the diffraction solver 108 which 
corresponds to the set of initial values 252 of the profile type selected and its associated 
films. This spectrum or spectra are then compared with the measured data as in block 
122 of Fig. 5 A and a non-linear optimization tool may be utilized as described to arrive 
after convergence, along path 254, at a final set 260 of parameter values of the profile 
type. If the profile type of Fig. 4A is selected, for example, the final set 260 would 
comprise the final values of the CD, height and sidewall angle of each trapezoid. 

[0071 1 In order to speed up the process described in reference to Fig. 5 A, a coarse 

library such as that indicated in Fig. 6B may be pre-computed off-line, so that each 
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profile type in the gallery is stored together with a number of sets of initial parameter 
values, such as those corresponding to the intersection points (e.g., 252) in the grid-like 
structure in Fig. 6B, and their corresponding spectra. The diffraction solver 108 is then 
used to compute the spectrum corresponding to each of the intersection points and such 
spectra are stored together with the profile type and the associated sets of initial 
parameter values at the intersection points. Then, when a simulated profile becomes 
available, such as simulated profile 244 in Fig. 6A, such simulated profile is then 
matched against the predicted profiles that correspond to the different sets of initial 
parameter values corresponding to the intersection points in Fig. 6B in the coarse library. 
From this comparison, a particular intersection point in the grid-like structure and the 
corresponding set of initial parameter values may be quickly identified and the process of 
blocks 120, 122, 124 and 126 of Fig. 5A may be carried out very quickly to locate the 
final set 260 of parameter values of the profile type. Thus, while the resolution of the 
coarse library of Fig. 6B is not sufficient for measurement, it provides significant 
acceleration of non-linear optimization. Where a coarse library is not constructed before 
hand, the center point 250, its conresponding set of parameter values and spectra, may.be 
used as the starting point for the optimization process in Fig. 5 A. 

[00721 The above-described radiation parameters may be measured in a manner 
known to those in the art, using the systems in Figs. 1 A and 2. Another aspect of the 
invention is based on the observation that certain radiation parameters may be more 
sensitive to the change in one or more parameters associated with the profile type and 
related films than other radiation parameters. This is illustrated in Fig. 6C. From the 
profile type, its associated film(s) and the initial values 252 of the parameters selected, 
the diffraction solver 108 generates predicted spectra of different radiation parameters. 
Shown as examples in Fig. 6C are the spectra 270 for four different radiation parameters 
that are so generated: R^, Rp, cosA and tanxj/. The different parameter values (e.g. CD, H, 
SWA) associated with the profile type are then varied and the diffraction solver 108 is 
used to generate a set of different spectra for each of the four or more different radiation 
parameters. By comparing the change in spectra of the four or more radiation parameters 
corresponding to the same variation in parameter value (e.g. CD, H, SWA), the radiation 
parameter and its conresponding spectra that is the most sensitive to the change in 
parameter value is then identified. 
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[0073] In other words, each of the selected profile types is varied. Thus, if the profile 
type of Fig. 4A is selected, then each of the parameters CD, H and SWA is varied. For 
each variation of each of the three parameters, diffraction solver 108 computes the 
corresponding spectrum for each of the four or more radiation parameters. A quantity 
may be defined by the equation: 



This quantity ( ) measures the difference between two sets of data jR, and , which 
can be, e.g!, the theoretical and the experimental values of a certain signal 
(/?„jRp,cosA,...). The values a„set the weight of the n-th data point and are typically 
defined by the experimental uncertainty. In calculating %^ in Fig.6C, actually two 
theoretical spectra are compared- one at the initial parameter values with the one at a 
modified parameter values. Quantities other than may also be used in optimization for 
example, the cross-correlation between the compared spectra may be optimized 

[0074] The quantity x^ along path 254,is thus computed according to the equation above, 
which is the difference between two theoretical spectra-one at the initial parameter 
values 252 and the one where one of the parameter values has been modified firom its 
initial value. In the four sets of spectra 270 shown in Fig. 6C, a number of curves are 
computed for each of the four radiation parameters, where each curve corresponds to the 
theoretical spectrum with one of the parameters having a value that is modified compared 
to the initial value. The four quantities of the four radiation parameters corresponding 
to the same modification in CD, H or SWA are then compared to identify the radiation 
parameter that is the most sensitive to a change in CD, H or SWA, and its spectra. In the 
example 274 shown in Fig. 6C, is the largest for the radiation parameter tan\|/. 
Therefore, if the radiation parameter tanvj/ is chosen for the modeling process shown in 
Fig. 5 A, a more accurate result may be achievable. In other words, when the apparatus of 
Fig. 1 A or 2 is used to measure the spectra associated with the diffracting structure and 
any associated films (block 120 in Fig. 5 A), the radiation parameter tan\|/ is measured 
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over a range of wavelengths, and such spectrum is then compared (block 122) to tany 
generated by the diffraction solver 108 in the flowchart of Fig. 5 A, to arrive at a more 
accurate set of values for the final set 260 of Fig. 6B. 

[0075] Fig. 6C illustrates four of the radiation parameters that may be used. A more 
complete list includes the following 12 radiation parameters: 

Rs.Rp>^. -^p.cosA, tan\(/ =|r/r, 1. 
R,/R^,X=\r,-r^tY=\r, + r,\\X/Y,(X-Y)KX+Y) 

2jR„oos^AjR,sm^A 

/y = _}L-£ ^ , — cos A 

^ R^cos^A'^-R.sixrA 

where r,and denote the complex amplitude reflection coefficients for S and P 
polarizations respectively, while R^md i?^are the reflectivities for S and P polarizations 
respectively: R,=\r,\\R,^r^f ^ The angle A is the analyzer angle and can be 
(optimally) set by hardware configuration. The quantities tan y and cos A are 
ellipsometric parameters known to those in the art. 

[0076] It will be noted that the process described in reference to Figg. 5A, 5B takes into 
account both profile and fihn parameters, so that the process described above in reference 
to Fig. 6C selects the radiation parameter and its associated spectra that is the most 
sensitive to a variation in a profile and/or fihn parameter. 

[0077] The advantages provided by different aspects of the process described above are 
set forth in the table below. 
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Figs. 6A, 6B, 6C. Off-line techniques developed and 
used in this invention that allow real-time measurement 
of profile and film slack parameters by the method of 
Figs. 5A, 5B 



Method 


Advantage 


Analysis of manufacturing 
process infonnatlon by a 
lithography simulation tool 
(Rg.6A). 


Optimal choice of profile model 
and process window for 
parameters 


Selection of one or more 
signals (spectra) that are most 
sensitive to parameters of 
Interest (Rg.6C) 


Ability to measure both profile 
and film parameters by 
selecting most sensitive signals 


Generation of a look-up table of 
eigenvalues in the grating 
region 


Replacement of the eigenvalue 
computation by interpolation to 
speed up the diffraction solver 
In the real-time measurement 


Generation of a coarse library 
of spectra within the process 
window (Fig.6B) 


Best initial seed to accelerate 
the convergence of nonlinear 
optimization 



[0078] Fig. 7 is a block diagram of an integrated spectroscopic dif&action-based 

metrology system , a photolithographic track/stepper and an etcher to illustrate another 

aspect of the invention. A layer of material such as photoresist is formed on the surface 

of a semiconductor wafer by means of track/stepper 350, where the photoresist forms a 

grating structure on the wafer. One or more of the CD, H, SWA and/or other parameters 

of the grating structure are then measured using systems 10, 80 of Fig, 1 A, 2 and one or 

more of the above-described techniques may be employed if desired to find the value(s) 

of the one or more parameters of the photoresist pattern and its associated film(s). Such 

value(s) from the computer 40 are then fed back to the track/stepper 350, where such 

information may be used to alter the lithographic process in track/stepper 350 to correct 

any errors. In semiconductor processing; after a layer of photoresist has been formed on 

the wafer, an etching process may be performed, such as by means oF etcher 360. The 

layer of photoresist is then removed in a manner known in the art and the resulting 

grating structure made of semiconductor material on the wafer may again be measured if 
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desired using system 10 or 80. The value(s) measured using any one or more of the 
above-described techniques may be supplied to the etcher for altering any one of the 
etching parameters in order to correct any errors that have been found using system 10 or 
80. Of course, the results obtained by one or more of the above described techniques in 
system 10, 80 may be used in both the track/stepper and the etcher, or in either the 
track/stepper or the etcher but not both. The track/stepper 350 and/or etcher 360 may 
form an integrated single tool witti the system 10 or 80 for finding the one or more 
parameters of a dif&acting structure, or may be separate instruments firom it. 

[0079) Fig. 8 is a schematic view of the track/stepper 350 and an associated flowchart 
illustrating a process for semiconductor wafer processing to illustrate in more detail the 
points of integration of the processing process with the detection of profiles of diffiracting 
structures and associated films to illustrate in more detail a part of the process in Fig. 7. 
As shown in Fig. 8, a semiconductor wafer 352 may be loaded firom a cassette loader 354 
to several stations labeled "prime," "coat," "soft bake," "EBR." Then the wafer 352 is 
delivered by a stepper interface 356 to exposure tool 358. The different processes at the 
four locations mentioned above are set forth below: 

[0080] At tlie location "Prime", the wafer undergoes chemical treatment before a 
layer of photoresist is spun on it, so that the photoresist layer can stick to wafer. At the 
location "Coat", a layer of photoresist coating is spun onto the wafer. At "Soft bake", the 
layer of resist is baked to remove chemical solvent firom the resist. At "EBR" which 
stands for "edge-bead removal", a solvent nozzle or laser is used to remove excess 
photoresist firom the edge of wafer. 

[0081] After the wafer has been exposed to radiation by tool 358, the wafer then 
undergoes four additional processes: "FEB," "FEB chill," "Develop," and "Hard bake." 
At "FEB or post exposure bake", the wafer is baked to reduce standing-wave effect from 
the exposure tool. Then it is cooled at "FEB chill". The wafer is then washed with 
reagent lo develop the photoresist, so that un-exposed (negative) or exposed (positive) 
photoresist is removed. The wafer then is baked at "Hard bake" to stabilize the 
photoresist pattern. It will be noted that all of the components of device 350 of Fig. 8 
except for the "exposure tool" 358 is known as the "Track" (also called cluster). 
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[0082] After these latter four processes have been completed, the wafer 352 is then 
returned to the cassette loader 354 and this completes the processmg involving the 
stepper 350. Tlie detection system 10 or 80 may be applied at arrow 362 to measure the 
parameters of the diffracting structure and associated film(s). Thus, such parameters may 
be measured after "hard bake." 

[0083] There are several ways of enabling a measurement of a profile using 
information coming firom a separate film or another profile measurement: 

a. Performing a film measurement to determme the thickness of underlying layers 
and feeding the information forward to a separate scatterometry measurement on a 
grating target of photo resist using regression. This reduces the number of 
variables in the regression, which speeds up significantly the calculation and 
improves the accuracy/robustness. 

b. Performing a film measurement to determine the n&k of underlying layers (e.g. 
the BARC layer in a gate ADI measurement) and feeding the information forward 
to a separate scatterometry measurement on a grating target of photo resist using 
regression. This reduces the number of variables in the regression, which speeds 
up significantly the calculation and eliminates errors due to incorrect optical 
constants in a library or regression schemes. 

c. Performing a fihn measurement to determine the thickness of underlying AND 
CURRENT etched layers (layers that are part of the grating structures)and feeding 
the information forward to a separate scatterometry measurement on a grating 
target of etched dielectric. This enables the measurements of damascene structures 
that otherwise would have too many degrees of freedom in the profile library or 
regression measurement. 

d. Perfonning a profile measurement using a scatterometry and feeding it forward as 
the seed to the next scatterometry regression measurement 

e. Performing a scatterometry measurements and feeding the information (most 
significantly - side wall angle and pitch) to a CD-SEM measurement on the same 
target or a different target on the same wafer. This eliminates the uncertainty that 
exist in SEM algorithms with respect to the wall angle dependence of CD 
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measurements. In addition, it gives the CD SEM an absolute calibration of the 
pitch. 

f. Performing a scatterometry measurements and feeding the information (most 
significantly - Top rounding and wall angle) to an overlay measurement on the 
same target or a different target on the same wafer (e.g. the zebra targets). This 
eliminates the uncertainty that exist in overlay algorithms with respect to wafer- 
induced shifts (as a results of wafer processing variations). 

g. Combine a multiple angle of illumination information either in a feed forward 
scheme or in a simultaneous regression to determine the profile parameters of a 
structure. For example the use of a normal or near normal reflectometer together 
with an oblique one to provide additional information on the structure (one or 
both can be an SE). 

h. Performing a profile measurement using scattrometry by regression on the 
reflected signal or the phase signals as collected by the metrology tool (typically a 
spectroscopic elUpsometer) 

i. Performing a regression to find the profile of a structure using a sub set of the * 
collected spectra, polarization or phase information that is less sensitive to underlying 
film properties. 

[0084] Figs. 9A, 9B, 9C are schematic views of sample structures useful for 

illustrating different embodiments of the invention for determining the profile or 
parameters of a diffracting structure. 

(0085) Thus, the fihn stack 360 as shown in Fig. 9A is measured at site 1 by 

means of a reflectometer, spectrophotometer or ellipsometer as illustrated in Figs. 1 A and 
2 to obtain the thicknesses and/or complex indices of refraction of the different layers in 
the film stack. The instruments in Figs. 1 A and 2 may also be used to measure intensity 
and phase information from the diffracting structure 362a on top of the fihn stack 360 at 
site 2. If the film thickness and indices of refraction information obtained by measuring 
the film stack 360 at site 1 is used in the calculations or modeling of the radiation data in 
measuring structure 362 at site 2, the number of variables in the regression or calculation 
will be drastically reduced. This significantly speeds up the calculation of the profile 
and/or parameters (e.g. critical dimension, side wall angle and thickness) of the 
diffracting structure 362a. 
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[0086] In some applications, the film stack that is available for measurement at 

site 1 may not be identical to the film stack associated with the diffracting structure; this 
is illustrated in Fig. 9B. Thus, the film stack 364 measured at site 1 is different from the 
film stack 366 over or below a diffracting structure 368a of the overall structure 368. 
However, as long as there is one common layer between the film stacks 364, 366, 
measurement of the stack 364 may still yield information useful for simpHfying the 
measurement of the diffracting structure 368a. Thus, as shown in Fig. 9B, the two film 
stacks 364, 366 have a substantially identical layer 370. In such event, knowledge of the 
thickness and the complex index or refraction of layer 370 would simplify the calculation 
or regression process in determining the profile or parameters of the diffiracting structure 
368a. This is true even where the two layers 370 in the two stacks 364, 366 are not 
identical, but have substantially the same thickness or the same index of refiraction or 
other optical properties. 

[0087] Figs. 9A, 9B illustrate structures obtained more frequently in lithographic 

processing. During etching processes, frequently only one portion of a layer may be 
etched while leaving another portion of the layer unetched. In such event, measurement 
of the unetched portion may help to simplify the determination of the profile or 
parameters of the diffracting structures in the etched portion. This is illustrated in Fig. 
9C. As shown in Fig. 9C, the unetched portion 372 includes layers 0, 1 and 2. Prior to 
etching, layers 0, 1, 2 of portion 374 are substantially the same as those in portion 372. 
During etching, layer 2 has been etched into diffracting structure 374a of the overall 
structure 374. Therefore, if structure 372 at site 1 is measured by means of the apparatus 
in Fig. 1 A or 2, the information concerning the thicknesses and the indices of refiraction 
of the three layers in structure 372 would greatty reduce the complexity of the calculation 
or optimization process for deriving the profile or parameters of the diffracting structure 
374a. 

[0088] The above-described feature is applicable even where layers 0 and 1 and 

the corresponding layers in structure 374 are not identical, or when layer 2 of structure 
372 does not have the same thickness or index or refraction as the diffracting structure 
374a. As long as there is some common parameter such as thickness or index or 
refraction between any one layer in structure 372 and another layer in the structure 374, 
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measuring structure 372 at site 1 and feeding the information forward to the measurement 
of structure 374 would simplify the second measurement. 

[0089] Fig. 10 illustrates an embodiment where data obtained from a 

scatterometric measurement tool 376 may be fed forward to a CD-SEM measurement tool 
378 performing a measurement on the same target or a target with some common features 
(in the same sense as those discussed above in reference to Fig. 9C). 

[0090] Fig. 10 also illustrates an embodiment where data obtained from a 
scatterometric measurement tool 376 may be fed forward to an overlay measurement tool 
378 performing a measurement on the same target on a target with some common 
features (in the same sense as those discussed above in reference to Fig. 9C). 

[0091] Fig. 1 1 is a schematic view of an overlay tool measuring the offset 

between two gratings (only one grating 382 is shown in Fig. 1 1). A radiation beam 381 is 
reflected by mirrors 385 and focused by lens 384 to the target 382. The scattered or 
diffracted radiation is collected by lens 386 and focused to CCD 387, whose output is 
sent to computer 40 to compute the misalignment between the gratings. Alternatively, 
the target may be a box-in-box type target, a one-dimensional unage of which is shown in 
Fig. 12. The profiles of both types of the target will influence the overlay measurements. 
Thus, as illustrated in Fig. 12, the shape of the edges of a box-in-box type target can have 
an effect on the radiation diffracted by the box-in-box target. Therefore, by measuring 
the profile or other parameters of the target, and feeding this information to the overlay 
measurement tool, the overlay measurement will be more accurate. 

[0092] In Fig. 6C, and the accompanying description above, it is noted that it may 

be possible to select an optimum parameter for the matching process in order to obtain a 
more accurate result for the parameter values. In some applications, the reverse may be 
desirable. Thus, where it is desirable to measure the profile or parameters of a diffracting 
structure that is adjacent to a film structure, but where the thicknesses and indices of 
refraction of the film structure are not readily available or cannot be measured, it may be 
desirable to choose parameters where the influence of the film structures on the 
diffracting structure measurement is minimized. For such applications, essentially the 
same process as that described above in reference to Figs. 6C may be carried out. In 

-27" 



wo 03/054475 



PCT/US02/41151 



contrast to the discussion of Fig. 6C, however, instead of selecting the parameter (e.g. 
film thickness or indices of reflation) of the film structure by which the spectral 
variations are maximized for a given change in the parameter value, the film parameter 
which gives rise to the smallest change in the spectral variations when it is changed is 
selected instead. 

Software Upgrades 

[0093] The invention has been described above, employing a system such as that 
shown in Fig. 1 A, 2 and 11. While the various optical components in the system of Fig. 
1 A 2 and 1 1 are used to obtain measured data fi-om the sample, many of the other 
processes are performed by computer 40 (not shown in Fig. 2 to simplify the figure). 
Thus, for many systems currently being used by manufacturers such as semiconductor 
manufacturers, the computers used in the systems may not have the capability to perform 
the techniques described above. Thus, another aspect of the invention envisions that the 
software in these computers can be upgraded so that computer 40 can perform one or 
more of the above described different fimctions. Therefore, another aspect of the 
invention involves the software components that are loaded to computer 40 to perform 
the above-described fimctions. These fimctions, in conjunction with the optical 
components of system 10 or 80 or 380 in Fig. 1 A or 2 or 1 1, provide results with the 
different advantages outlined above. The software or program components may be 
installed in computer 40 in a variety of ways. 

[0094] As will be understood in the art, the inventive software components may be 
embodied in a fixed media program component containing logic instmctions and/or data 
that when loaded into an appropriately configured computing device to cause that device 
to perfonn according to the invention. As will be understood in the art, a fixed media 
program may be deUvered to a user on a fixed media for loading in a users computer or a 
fixed media program can reside on a remote server that a user accesses through a 
communication medium in order to download a program component. Thus another 
aspect of the invention involves transmitting, or causing to be transmitted, the program 
component to a user where the component, when downloaded into the user's device, can 
perform any one or more of the fimctions described above. 

[0095J Fig. 13 shows an information appliance (or digital device) that may be 
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understood as a logical apparatus that can read instructions from media 417 and/or 
network port 419. Apparatus 40 can thereafter use those instructions to direct server or 
client logic, as understood in the art, to embody aspects of the invention. One type of 
logical apparatus that may embody the invention is a computer system as illustrated in 40, 
containing CPU 404, optional input devices 409 and 41 1, disk drives 415 and optional 
monitor 405. Fixed media 417 may be used to program such a system and may represent 
a disk-type optical or magnetic media, magnetic tape, solid state memory, etc.. One or 
more aspects of the invention may be embodied in whole or in part as software recorded 
on this fixed media. Commxmication port 419 may also be used to initially receive 
instructions that are used to program such a system to perform any one or more of the 
above-described fimctions and may represent any type of communication connection, 
such as to the internet or any other computer network. The instructions or program may 
be transmitted directly to a user*s device or be placed on a network, such as a website of 
the intemet to be accessible through a user's device. All such methods of making the 
program or software component available to users are known to those in the art and will 
not be described here. 

[0096] The invention also may be embodied in whole or in part within the circuitry of 
an application specific integrated circuit (ASIC) or a programmable logic device (PLD). 
In such a case, the invention may be embodied in a computer understandable descriptor 
language which may be used to create an ASIC or PLD that operates as herein described. 

[0097] While the invention has been described ^bove by reference to various 
embodiments, it will be understood that changes and modifications may be made without 
departing from the scope of the invention, which is to be defined only by the appended 
claims and their equivalents. All references referred to herein are incorporated by 
reference in their entireties. 
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WHAT IS CLAIMED IS : 



1 1 . A method for measuring one or more parameters of a diffracting structure 

2 comprising a first material, said structure located adjacent to one or more first film 

3 structures having associated thickness and optical index information, comprising: 

4 measuring data associated with a reference structure, said reference structure 

5 comprising at least one layer that has substantially the same thickness as the diffracting 

6 structure, and/or comprises a second material having substantially the same optical 

7 properties as those of the first material; 

8 directing a beam of electromagnetic radiation at a plurality of wavelengths at said 

9 diffracting structure and the one or more fihn structures; 

1 0 detecting intensity and/or phase data of a diffraction at said pluraUty of 

1 1 wavelengths from said structure of said beam; and 

12 determining said one or more parameters using the data associated with the 

13 reference structure and data detected from said structure. 

1 2. The method of claim 1 , wherein said reference structure is located adjacent 

2 to a film structure comprising layers, at least one of which has substantially the same 

3 thickness as one layer in the one or more first film structures and/or which comprises a 

4 material having substantially the same optical properties as those of a material in the one 

5 or more first film structures, said method further constructing a reference database of the 

6 one or more parameters using thickness and optical index information of the film 

7 structure adjacent to the reference structure, wherein said determining uses the reference 

8 database. 

1 3. The method of claim 2, wherein said constructing constructs a reference 

2 database comprising a plurality of functionSj each of said functions corresponding to a 

3 probable linewidth, height or wall angle of said diffracting structure. 

1 4. The method of claim 2, wherein said constructing constructs a reference 

2 database over a spectrum of wavelengths, said directing directs a beam of broadband 

3 radiation at wavelengths including said spectrum and said detecting detects intensity or 

4 ellipsometric parameter data over said spectrum of wavelengths. 
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1 5. The method of claim 1 , wherein said measuring measures intensity data or 

2 ellipsometric parameters. 

1 6. The method of claim 1 , wherein said detecting detects intensity data or 

2 ellipsometric parameters. 

1 7. The method of claim 1, said parameters including critical dimension, 

2 height and sidewall angle. 

1 8. The method of claim 1 , wherein said one or more first fikn structures have 

2 no periodic diffracting pattern thereon, and the measuring measures by means of a 

3 spectroscopic ellipsometer, a spectrophotometer or a spectroreflectometer. 

1 9. The method of claim 1, wherein said directing and detecting are by means 

2 of a spectroscopic ellipsometer, a spectrophotometer or a spectroreflectometer. 

1 10. The method of claim 1 , further comprising providing infomiation 

2 concerning optical index or indices and fihn thickness(es) of the one or more first film 

3 structures, and constructing a reference database of the one or more parameters related to 

4 the diffracting structure using said optical index and film thickness information of the one 

5 or more first film structures, wherein said determining uses the reference database. 

1 11. The method of claim 10, wherein said constructing constructs a reference 

2 database comprising a plurality of functions, each of said functions corresponding to a 

3 probable linewidth, height or wall angle of said diffracting structure. 

1 12. The method of claim 10, wherein said constructing constructs a reference 

2 database over a spectrum of wavelengths, and said directing directs a beam of broadband 

3 radiation at wavelengths including said spectrum and said detecting detects intensity or 

4 ellipsometric parameter data over said spectrum of wavelengths. 

1 13. The method of claim 1 , wherein said detecting detects a zeroth order 

2 diffraction of said beam from said diffiacting structure. 

1 14. The method of claim 1, wherein said directing directs polarized radiation 

2 to the diffracting portion. — 
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1 15. The method of claim 1, said plurality of wavelengths including ultraviolet 

2 wavelengths. 

1 16. A method for measuring one or more parameters of a first periodic 

2 diffracting structure of a sample, comprising: 

3 performing scatterometric measurements on a second reference diffracting 

4 structures to obtain intensity or phase data; 

5 performing scatterometric measurements on the first diffracting structure to obtain 

6 intensity or phase data; and 

7 obtaining the one or more parameters on the first diffracting structure using results 

8 from the measurements on the second diffracting structure. 

1 17. The method of claim 16, wherein one or both of said scatterometric 

2 measurements performed measure ellipsometric parameters. 

1 18. The method of claim 16, wherein both diffracting structures are of the 

2 same sample, so that both scatterometric measurements are performed on the sample. 

1 19. The method of claim 1 6, said scatterometric measurements on the second 

2 reference diffracting structure yielding a profile of such structure, wherein said obtaining 

3 comprises an optimization process employing the profile of the second diffracting 

4 structure as a seed profile. 

1 20. The method of claim 1 9, said optimization process comprising a 

2 regression process. 

1 2 1 . A method for measuring one or more parameters of a sample having one 

2 or more periodic diffracting structures thereon, comprising: 

3 performing scatterometric measurements on a first one of the diffracting structures 

4 to obtain intensity or ellipsometric data; 

5 • performing SEM measurements on a second one of the diffracting structures to 

6 obtam critical dimension or profile data, the second one being the same or different from 

7 the furst diffracting structure; and 
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8 obtaining the one or more parameters on the second diffracting structure using 

9 results from the measurements on the first diffracting structure. 

10 22. The method of claim 21, wherein said obtaining obtains an absolute 

1 1 calibration of pitch of the second diffracting structure. 

1 23. A method for measuring one or more parameters of a sample having a 

2 plurality of periodic diffracting structures ttiereon, comprising: 

3 perfonning scatterometric measurements on a first one of the diffracting structures 

4 to obtain intensity or ellipsometric data; 

5 performing overlay measurements on a pair of the diffracting structures or lines or 

6 bars or boxes useful for deriving misalignment information between the pair; and 

7 deriving the misalignment information between the pair from result of the overlay 

8 measurements wherein a result of scatterometric measurements is employed to derive the 

9 misalignment information. 

1 24. The method of claim 23, wherein the result of scatterometric 

2 measurements comprises critical dimension, height, sidewall angle or profile of the first 

3 diffracting stmcture, 

1 25 . A method for measuring one or more parameters of a periodic diffracting 

2 structure with or with-out an adjacent film structure having associated thickness and 

3 optical index information, comprising: 

4 providing a profile type for the periodic diffracting structure and the film 



5 stmcture, said profile type associated with one or more parameters related to the periodic 

6 diffracting stmcture and information related to the film structure, said profile type also 

7 associated with a plurality of sets of radiation data of different radiation parameters, said 

8 radiation parameters including reflectance or transmittance parameters and ellipsometric 

9 parameters; 



10 selecting at least one set of radiation data from the sets of radiation data of 

1 1 different parameters associated with the profile type based on sensitivity of such data to a 

12 change in the information associated with the film structure as derived from the film 

13 stmcture or the periodic diffracting structure; 
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14 detecting radiation data from the periodic diffracting structure; and 

15 comparing the detected radiation data to the set selected to arrive at a set of 

16 value(s) of the one or more parameters. 

1 26. The method of claim 25, wherein said providing comprises: 

2 supplying a gallery of a plurality of profile types, each profile type associated with 

3 one or more parameters related to the periodic diffracting structure and information 

4 associated with the film structure and associated with a set of radiation data, wherein at 

5 least one of said profile types provided is associated with a plurality of sets of radiation 

6 data of different radiation parameters, said radiation parameters including reflectance or 

7 transmittance parameters and ellipsometric parameters; and 

8 selecting a profile type from the gallery. 

1 27. The method of claim 25, wherein said selecting selects the at least one set 

2 of radiation data based on a criterion that the selected at least one set is less sensitive than 

3 the non-selected sets to a change in the information associated with the film structure. 
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Figure &B Flowchart of Diffraction Solver 
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Figure . Selection of the optimal signal for matching 
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